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Abstract. Polytopes from subgraph statistics are important in applications and con- 
jectures and theorems in extremal graph theory can be stated as properties of them. We 
have studied them with a view towards applications by inscribing large explicit polytopes 
and semi-algebraic sets when the facet descriptions are intractable. The semi- algebraic 
sets called curvy zonotopes are introduced and studied using graph limits. From both 
volume calculations and algebraic descriptions we find several interesting conjectures. 

1. Introduction 

In this paper we study polytopes from subgraph statistics. The vertices of these poly- 
topes are given by the relative proportions of subgraphs of different types. We got in- 
terested in studying the polytopes from subgraph statistics after several questions were 
raised about them by Rinaldo, Fienberg and Zhou [21\. They investigated maximum 
likelihood estimation for exponential random graph models and realized that its behavior 
is closely linked to the geometry of these polytopes. The subgraphs counted are usually 
determined by the applications of the model, and in the social sciences small graphs as 
stars and triangles are common [22], but we make no restrictions of that type. 

Our results and methods are from graph theory and discrete geometry, but we have 
made an effort to address directions that are important for applications. For example to 
understand when the polytopes have the expected dimensions and how to approximate 
them with similar polytopes or semi-algebraic sets with easy explicit descriptions when 
their facet structures are not attainable. 

On the track to these descriptions we have found several, for us, unexpected results 
and conjectures about both enumerative and geometric combinatorics. 

For the experts who wants to skip ahead, we want to clarify a notational difference 
between mathematical communities: Our focus is on finite graphs and finite dimensional 
polytopes close to applications, even if we use limits and infinite objects as technical tools 
in some of the later proofs. Therefore the t-function counts ordinary honest subgraphs and 
not graph homomorphisms, and these notions are crucially different in the finite setting 
before the limit. 

1.1. A short overview of the paper. In Section [2] we define the polytope from sub- 
graphs statistics and its lattice version. We explain basic properties of their facet struc- 
tures and how different polytopes and Ehrhart polynomials are related to each other. A 
complete facet description would solve many open problems in extremal graph theory, so 
to get any understanding of these polytopes we inscribe more well-studied polytopes and 
semi-algebraic sets in them. 
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The spine of our polytopes is defined in Section [3j The convex hull of a finite number 
of points on the spine is a cyclic polytope inscribed in the polytope of subgraph statistics, 



and the convex hull of the spine is a semi-algebraic set. In Section 3.1 we calculate the 
volumes of the inscribed sets and find both explicit formulas and integrals connected to 
the Selberg integral formula. The dual of the convex hull of the spine and the polytope 
provides a method to find certificates that polynomials are non-negative. 

The convex hull of the spine never fills up all of the polytope, and can even be of lower 
dimensions. In Section [4] we introduce the curvy zonotopes. They are semi-algebraic sets 
with explicit descriptions that are of the top dimension. Alternatively the spine could have 
been defined as the expected values of graphs from the Erdos-Renyi graph model with 
different model parameters. We get the curvy zonotopes as expected values of particular 
exchangeable graph models. 

In Section [5] we show that the curvy zonotopes are not only of the right dimension, but 
that they can be chosen to get the volume arbitrary close to the polytope that they are 
inscribed in. The proofs relies on the theory of graph limits and Szemeredi regularity. 
Finally in Section [6] we study the limit case of counting complete subgraphs and conjecture 
that the limit objects essentially are cyclic polytopes with infinite many vertices. 

2. Basic properties of polytopes of subgraph statistics 

In this section we give proper definitions of the objects of our study, and provide some 
first results to describe them. We follow standard notation in graph theory, as in for 
example Diestel [7j. 

The number of subgraphs of G isomorphic to F is t L (F, G). Later the letter L indicates 
that we work with lattice polytopes. The F-subgraph density in G is defined as 

HFC\- tL(F ' G) 

except when F has more vertices than G, and then it is 0. 

Definition 2.1. Let F = (F 1} . . . , Fj) be a vector of graphs and G a graph. The vector 
valued subgraph statistics are 

t(F,G) = (t(F 1 ,G),...,t{F d ,G)) 

and 

t L (F,G) = (t L (F 1 ,G),...,t L (F d ,G)). 

Definition 2.2. Let F be a vector of graphs and n a positive integer. The polytope from 
subgraph statistics Pp ;n and its lattice version Pp. n are defined as 

Pp;n — conv {t(F, G) | G is a graph on n vertices} 

and 

Pp. n = conv {t L (F, G) | G is a graph on n vertices}. 

Example 2.3. The polytope P(K 3 ,c 4 ,K 4 \e);6 is drawn in Figure [TJ and in Figure [2] is a 
combinatorial representation of its vertices and edges. 
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If larger examples of polytopes from subgraph statistics looks anything like in Figures [T] 
and [2| then it will be very difficult to give an explicit facet description. And indeed many 
hard theorems and conjectures in extremal graph theory can be rephrased as questions 
about these polytopes, making a complete facet description probably impossible in gen- 
eral. In Figure [2] we tabulated the vertices by the actual subgraph counts and not the 
proportions t(F,G). This defines the lattice polytope Pp. n , a rescaling of Py-u- Several 




Figure 1. The polytope P(K 3 ,c 4 ,K A \e)-fi- 



(8,10,16) 




Figure 2. A combinatorial representation of the vertices 
P(^ 3) c 4 ,_ft' 4 \e);6, indexed by the actual subgraph counts. 



and edges of 
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graphs could have the same subgraph statistics, and even if t(F, Gi) and t(F, G 2 ) are 
different vertices on the same facet, it is not necessary that G\ and G<i are related in any 
sense, for example as subgraphs. This is illustrated in Figure [3] 




(16,33,60) 



(0,0,0) 



00 

and all forests 

Figure 3. The graphs underlying the statistics of a piece of the polytope 
in Figures [T] and [2] Dotted graph edges could be included or not. Recall 
that the subgraphs counted are (K 3 , C 4 , K± \ e). 



Before embarking on more general results about the polytopes from subgraph statistics, 
we point out some easy facts. 

Proposition 2.4. Let F = (F\, . . . , Fd) be a vector of graphs with edges of order at least 
n, none of them a subgraph of another. Then the inequalities x% > are facet defining for 

PF;n- 

Proof. The number of subgraphs is non-negative and the inequality Xi > always defines 
a face, the question is if it is a facet. Let Gj be the graph given by the disjoint union of 
Fj and enough isolated vertices to get n vertices in total. The vector t(F,Gj) is non-zero 
exactly in the component j. On the hyperplane defined by x,- t = is a dimension d — 1 
simplex defined by the points and t(F, Gj) for i ^ j. □ 

Lemma 2.5. If F and G are graphs then t L (F, G) = j^h(}$)t(F, G). 
Proof. If t(F, G) = t L {F,G) = it is true. Otherwise use that t L {F, K\ G \) 



m (\g\ 



Aut(-F) \\F\ 



\\F\J- 



□ 



Lemma 2.6. If F and G are graphs and n an integer then 



V 117 U€l V ' a >) 
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Proof. If not \F\ < n < |G| then both sides are zero. Otherwise, consider a particular 
copy of F in G. On the right hand side it is counted for every subset U of size n containing 
the vertex set of that copy of F. The set U contains n — \F\ elements to be chosen among 
the \G\ — \F\ vertices outside that particular copy of F. □ 

Proposition 2.7. Let F be a vector of graphs of order at most n. If n < n' < n" then 

Pp-,n' 3 pF;n"- 

Proof. Let G be a graph on n" vertices and let F be any graph in F. First combine 
Lemma 2J3 and Lemma |2.6 to get 



(\G\\ (\G\-\F\\ 
\\F\)\n'-\F\) Ue (V(?)) 



t(F, G) = fm )S_ m £ WMW)- 



By expanding and simplifying, one gets that (\g\\}\g\-\f\\ — ('^') • Collected into vectors 

\\F\){n'-\F\) 



this is 

-l 



t(F,G)=(^j) Yl ^G\U]) 



and all vertices of P-py are in simplices spanned by points in Py-u 1 - D 

For any number k, a polytope P can be inflated to kP = {kp\p e P}. For a lattice 
polytope P and positive integers k, the number of lattice points in kP is the Ehrhart 
polynomial Ep(k). We refer to chapter 12 of Miller and Sturmfels |T7] for a proof of 
this, and a description of the connections to algebraic geometry. This is a translation of 



Proposition 2.7 into the lattice polytope setting. 



Proposition 2.8. Let F = (F\, . . . , Fd) be a vector of graphs of order n. Then 



t,, L [ [ n ')k] < /•;,,/ ( ("" ]/, 



z/n" >n'>n. 

Proof. Define a linear rescaling map Lp : M. d — > M> d by Lp(ej) = Aut(Fi) _1 ej. By 



Lemma 2.5 



L P ( n! ( " ) P F ;„» ) = P#. n „ and L P n! P F;n ' = Pp> 



n / / \ \ n 



According to Proposition 2.7 there is an inclusion of polytopes Pp; n » C Pp ;n / since n" > n' . 



We rescale the inclusion to 



n'\ ( . / n" \ „ \ f n" \ / .in' 



n\[ P F; „» C n! Pp ; ^ 

n j \ \ n J ) \n J \ \n J 

apply the linear rescaling map and move out some scalar factors, 
n I \ \ n } } \ n I \ \ n 



or equivalent ly, ( n )Pp. n » Q (^ 1 )P F n / - Counting lattice points gives the desired result. □ 
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In the proposition it is required that all graphs in F are of the same order, and this can 
partially be generalized by adding isolated vertices to get graphs of the same order. 

Example 2.9. If F is the graph vector of the path on three vertices and the triangle, 
then 

and 

E p l 3 U^j k^j = 50k 2 + 15k + 1 > 48k 2 + 13k + 1 = E p l & U^j k 

3. The spine of polytopes 

Since it's hard to understand the polytopes exactly, we inscribe more accessible poly- 
topes and semi-algebraic sets within them. For a vector F of m graphs, the spine is the 
generalized moment curve 

{(p e \p e \...,p e ™)\0<p<l}, 

where e\ is the number of edges in Fj. A graph G from the Erdos-Renyi random graph 
model Q(n,p) have n vertices, and edges are included independently with probability p. 

Proposition 3.1. Let F = (F\, . . . , Fd) be a vector of graphs of order at most n, then the 
spine {(p ei ,p e2 , ■ ■ ■ ,p £d ) | < p < 1} is in Pf;u- 

Proof. The expected value of t(F, G) with G from Q(n,p) is p e if F have e edges, since the 
edges of G are included independently with probability p. For every instance of graphs G 
from Q(n,p) the vector t(F, G) is a point in Pf ; „. The expected value of t(F, G), 

(p e \p e \...,p e ™), 

is then also a point in Pf;u- D 
In Figure [4] is the polytope P(K 3 ,c 4 ,K 4 \e);6) from Figure [T] drawn with its spine. The 



point of Proposition 3.1 is that the spine is a generalized moment curve inside Pf^u- The 
convex hull of a finite number of points on the spine is a cyclic polytope. This can be 
seen directly by using generalized van der Monde matrices instead of the ordinary one in 
Ziegler's textbook derivation of the combinatorial structure of cyclic polytopes [28]. This 
shows that there are cyclic polytope inscribed in Pf-u- 

The convex hull of all of the spine is not a polytope, but its boundary can be alge- 
braically described. In Figure [5] is the spine from Figure [4] drawn with its convex hull. 
Since the boundary structure of the convex hull in Figure [5] is not very clear from this 
angle, we include in Figure [6] the same spine with its convex hull, but from another 
perspective. 

3.1. Volumes. Inside our polytopes we have convex hulls of generalized moment curves 
and their volumes bound the volumes of polytopes from subgraph statistics. For the 
ordinary moment curve {(p,p 2 , ■ ■ ■ ,p d ) \ < p < 1} the volume of its convex hull was 
calculated by Karlin and Shapley [Hj. To calculate the volume of the convex hull of the 
spine, and for later applications, we need the following standard approximation. 
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Figure 4. The polytope in Figure [T] with its spine. 




Figure 5. The spine in Figure [4] drawn with its convex hull. 

Lemma 3.2. Let A C B C [0, l] d be sets so that for any b G B there is an a G A with 
\a — b\ < £ ; t/ien 

Vol(convB) — Vol(convA) < k^e 
where kj only depends on the dimension d. 

Proof. For X = A, B and £ G R d set 

F x (0 = sup(x,£) = sup (£,£)• 

x£X xSconvX 
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Figure 6. The spine with its convex hull from Figure [5] drawn from another perspective. 

By assumption, there is always a point within distance e in A from each point in B, so 
Hb(0 — £ < Ha(C) < Hb(0 for all £. Using the setup of Hormander [IT], section 4.3, the 
function H c (0 = H B (£) - £ defines a convex body C = {x \ (x,g) < H c (£),£ G R d } C 
convA C convi? with 

Vol(con Vj B) - Vol(C) < eS B 
where S B is the surface area of the closed convex hull of B. The surface area can be 
estimated from above with a kd since the volume of B is at most one, for example as in 
Section 1.8.3 of [3] . □ 

Proposition 3.3. Let e\ < e<i < ■ ■ ■ < e^, and n be positive integers. Consider the cyclic 
polytope P on the vertices 

{(x ei ,x e2 ,. . . ,x Cd ) | x = i/n, i = 0,1,2, ... ,n} 

and the convex hull C of the moment curve 

{(x ei ,x e2 ,...,x ed ) I < x < 1}. 

Then 

< Vol(C) - Vol(P) < — (ei + e 2 + • • • + e d ), 

n 

where kd is a constant depending only on the dimension d. 



Proof. Let / : [0, 1] — > IR rf be x i— > (x ei ,x e2 , . . . , x £d ). From the vector valued mean value 
theorem 

\f(y) - f(x)\ < \y -x\ sup \f'(y + t(x -y))\ 

0<t<l 

applied with y = [xn\/n and \f'(y + t(y — x))\ < |/'(1)| — e i + e 2 + • • • + e d , we get that 
any point on the moment curve is within distance 

ei + e 2 H Ye d 

£ = 

n 
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of a vertex of the polytope P. Now use Lemma 3.2 □ 



We treat Schur polynomials as they are defined by fractions of generalized Vandermonde 
matrices instead of using representation theory. For these elementary facts we refer to 
Sagan [23] • We will also use the Pfaffian, and refer to [HJ [TB] for definitions and basic 
identities. 

Theorem 3.4. Let m > 1 and let F = (Fi, . . . , F 2m ) be a vector of 2m distinct graphs 
with edges. The volume of the convex hull of the 2m -dimensional spine ofF 

Vol(conv{(p ei ,p e2 ,...,p e2m ) | <p< 1}) 



is 



(2m)! 



. S\(x±, X\, X2i X 2 , ■ ■ ■ ; %mj I I i%i x j) dxi " " " dx m , 

>A\m\ Jrniim - LJ - 

' J ^ U ' X J 0<i<j<m 



where Aj + 2m — i = ti in the Schur polynomial S\, and it is assumed that all e« are 
different and greater than 0. 

And in the odd dimensional case F = (Fi, . . . , F 2m+1 ) we get 



(2m + 



\ x , , I S x (x 1 ,x 1 ,x 2 ,x 2 ,...,x m ,x m ,l) TT (xi-xj) 4 TT (l-Xi) 2 dxi ■ ■ ■ dx 

- ly.ml hn ilm 



where Aj + 2m + 1 — z = in i/ie Schur polynomial S\, and it is assumed that all C{ are 
different and greater than 0. 

Proof. The idea of the proof is to approximate the volume of the convex hull of the spines 
with the volume of cyclic polytopes. The cyclic polytopes used to approximate the volume 
have their vertices placed uniformly distributed on the spine. When the number of vertices 
of these polytopes go to infinity the volume of the polytopes converge to the volume of 
the convex hull of the spine. 

To compute the volume of a cyclic polytope P we begin by triangulating the polytope 
P in a special way. The volume of the polytope P is then computed by summing the 
volumes of the simplexes in the triangulation. The triangulation used is constructed as 
follows: Pick a vertex v in P, the triangulation is the one consisting of all the simplexes 
spanned by a facet of P together with v. This is a triangulation of P since cyclic polytopes 
are simplicial. Note that the facets containing v do not contribute to the volume and so 
they can be ignored. 

Consider the cyclic polytope P n spanned by {((i/n) ei , (i/n) e2 , . . . , (i/n) e2m ) | < i < 
n}. The polytope P n has vertices uniformly distributed on the spine of F. When n goes to 
infinity the volume of P n converge to the volume of the convex hull of the spine. Choose 
the vertex v to be 0. By Gale's evenness condition [28] the volume of P n is 



m ) ! ^ ^ -( n ) ( n / 



(2m)! 

ti,.. 



l<j<2m 
Kk<m 



where the sequences summed over are those satisfying ij + 1 < ij+i, < %\ and i m < n. 
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The determinants can be expressed in terms of Schur polynomials and so the volume 
of the polytope P n is 



1 



(2m)! 



l\ l\ + 1 

n ' n 



"m in 
n 



n 



n' 



n 



l<j <k<rn 



where the sum is over the same sequences as in the formula with determinants. This is a 
Riemann sum and so when n go to infinity this converges to the integral 

\T / S\(x\, x±, . . . , x m , x m ) | | [xi Xj) dxi ■ ■ ■ dx m , 



(2m) 



l<j<j'<m 



where I = {0 < x\ < x 2 < ■ ■ ■ < x m < 1}. Both the Schur polynomial and the product 
of four-powers of differences are symmetric polynomials, and we can replace I by [0, l] m 
and divide by a factor m! to get the integral stated in the theorem. 



The integral is the volume of the convex hull of the spine according to Proposition 3.3 



The odd dimensional case is similar, but every full dimensional simplex contains the 
vertex 1 since the facets of a cyclic polytope in odd dimensions all contain or 1. □ 



In the even dimensional case there is a nice formula for the integral in Theorem 3.4 
Theorem 3.5. Let A = (Ai, . . . , \2m) in the Schur polynomial S\, then 



■^I ' 1 ]" 1 0<i<j<m l<i<i< 



X<i<j<2m 



Vi - Vj 
Vi + Vj 



where yi = 2m + Aj — i. 

In the odd dimensional case, let A = (Ai, . . . , A2 m +i) in the Schur polynomial S\. Then 



S x (x 1 ,xi,x 2 ,x 2 , • • • ,x m ,x m , 1) ] | (xi-XjY J [ {l-Xi) 2 dx = m! 

I ' 1 !" 1 0<i<j<m 0<i<m 

where yi = 2m + 1 + A» — i. 



TT Vi - Vi 

Vi + Vj ' 

l<i<j<2m+l t>l V- 1 



Proof. First we prove it in the even dimensional case. The first step of the proof is to go 
back to expressing the integrand as a limit of a determinant. This limit is also a nice deter- 
minant that is possible to integrate. Observe that S\(xi, x±, x 2 , x 2 , . . . , x m , x rn ) Yii 



) 4 is the limit of 



<i<j<m\ x 3 



det \x 



2m—j+Xj-\ 



|l<M<2m 



det \x 



2m-Jl 



Jl<*J<2m 



det^ J+ J ]l<i,j<2m 



det \x 



2m— ji 



\l<i,3<2m 



Yli=l( X 2i - %2i-l) 



n^Li(^2i 



X 2 i-1, 



as x 2i _i x 2i . Row operations on the matrix [x 2m ^ + J ]i t j=i 1 ...,2m don't change the deter- 
minant, and subtraction of row 2i — 1 from 2i for % — 1, 2, . . . , m yields 



det 



x 



2m—j+Xj 



2 1 



X 



2m—j+\j 



X 



2m—j+\j 
2i-l 



2i-l 



1 < i < m 
Kj<2m 



U7=l( X 2i ~ X 2i -l] 



det 



x 



2m—j+\j 
2% 



~ x 2i-l )l\ X 2i -X 2i -l) 
2m-l-j+\j 
dj 2i-\ 



1 <i <m 
Kj<2m 
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Using L'Hopital's rule as Xn-\ — > x 2 % for i = 1, . . . m, and then relabeling x 2 i — ► we 
get that the integrand in the theorem statement equals 



det 



(2m — j + Xj)x 

2m—j+Xj 



2m—l—j+\j 



X: 



l<z<m 
l<j<2m 



This determinant is a sum over the symmetric group S; 



^2 £ ^ Y[( 2m ~ + K(2i))x 

7rGS 2m *=1 



2in 



2m-l— 7r(2i)+A 7r(2 j) 2m - 7r ( 2 *- 1 )+ A 7r (2 I -i) 



where e(7r) is the sign of n. If we require that n(2j) > 7i(2j — 1) for all j then the integrand 
becomes 

m 

£ W II (((2m - 7r(2i) + A^)) - (2m - 7r(2i - 1) + A^!))) 

7re5 2 m, 7T(2j)>7r(2j-l) 



1=1 



2m-l-7r(2i)+A w(2l) 2m-7r(2i-l)+A 7r(2i _ 1) 



and with the notation = 2m + Aj — i it equals 



i-i 



7re5 2 m, 7r(2j)>vr(2j-l) 1=1 

Integrating over each separately gives the following value of the integral in the theorem: 

y-w{2i) - yn(2i-i) 



e 7r 



y7r(2i) + 2/ir(2»-l) 



7re5 2 m, 7r(2j)>7r(2j-l) 

The sum is actually the Pfaffian of an anti symmetric matrix, in general for A = [aij]ij=i J ...,zm 
anti symmetric 

Pf ^ = mT £(7r)n a T(2i-i) j7 r(2i). 

' vreS 2m , 7r(2j)>7r(2j-l) 8=1 

Hence 



S\(xi,X 1 ,X 2 ,X 2 , ■ ■ ■ ,X m ,X m ) ] [ ( 
P. 1 ]™ 0<i<i<m 



Xi — Xj) A dx = m!Pf 



Vi + Vj 



but we know that in general this Pfaffian evaluates to the stated product formula, a result 
usually attributed to Schur pMj . 

The first odd case is true. For higher odd, start off as in the even but then expand the 
matrix along its column with only ones. This gives a sum of instances from the smaller 
even dimensional case, and by a degree argument the formula follows. □ 

Corollary 3.6. Let F be a vector of k distinct graphs with edge counts e\ > ■ ■ ■ > > 0. 
Then the volume of the convex hull of the k- dimensional spine ofF is 

Vol(conv{(p ei ,p e2 ,...,p efc ) | <p< 1}) = — Yl 

rv. 



C{ Cj 



l<i<j<k 



Gi ~\~ Cj 
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Special cases of this integral have been computed before. Karlin and Shapley [H] used 
the Selberg integral formula [TQI to do the case of ej = i, but they arrived to the 
integral in a completely different way. Selberg's formula can only be applied in the cases 
where the edges are of consecutive magnitudes: = e + i. 

3.2. Duality. As for polytopes there is a duality theory for convex hulls of algebraic 
sets [3]. The dual of the convex hull of the moment curve {(p, p 2 , . . . ,p n ) | < p < 1} 
parametrizes the degree n polynomials that are non-negative on the interval [0,1]. The 
convex hulls of generalized moment curves are inside polytopes from subgraphs statistics, 
so the polytopes can be used to certify that polynomials are non-negative. 

Proposition 3.7. Let P be a polytope containing the generalized moment curve {{p ei ,p £2 , ■ ■ 
< p < 1}. If ((ci, C2, . . . , c d ), v) > —1 for all vertices v of P then the polynomial 
q(x) = 1 + C\X ei + C2X £2 + • • • + CdX ed is non-negative on the interval [0, 1] . 

Proof. The value of the polynomial q(x) = l+CiX ei +C2X e2 + - ■ -+CdX ed at the point x = p is 
l+((p ei ,p £2 , ■ ■ ■ ,p ed ), (ci, c 2 , . . . , c d )). This implies that if ((p ei ,p £2 , . . . ,p £d ), (c lt c 2) . . . , c d )) 
— 1 for all p in [0, 1] then the polynomial q is positive on [0, 1]. □ 

Example 3.8. Our running example P(K 3 ,C4„K4\eyfi in Figure [T] is perhaps not the most 
interesting polytope to certify non-negativity with, but we use it in an example anyways. 
The polynomial p(x) = 1— yi 3 + yx 4 - \x b is non-negative on [0,1] since (— y, > 
— 1 for all vertices v of P(K 3 ,c 4 ,K 4 \ey,6- Note that the point (— y, y, — |) is dual to the 
facet with vertices (8/20,10/45,16/90), (10/20,15/45,30/90), (5/20,3/45,6/90), which 
one can find using Figure [2} 



4. Curvy zonotopes 

In this section we generalize the ideas used to construct spines. A very general class 
of random graph models called exchangeable random graph models is used instead of 
the Erdos-Renyi model, but the idea is the same: The expectation values from random 
graph models give easily parameterized sets inside the polytopes of subgraph statistics, 
and these sets are useful to derive information about the polytopes. The exchangeable 
random graph models are related to the graph limits developed by Lovasz and Szegedy 
[15*] . as explained by Diaconis and Janson [6]. The machinery of graph limits is used more 
heavily in the next section where more of it is explained. In this section only the random 
graph models are needed. 

The exchangeable random graph models is a generalization of the Erdos-Renyi random 
graph model Q(n,p), obtained by replacing p by a measurable and symmetric function 
W : [0, l] 2 — ?• [0, 1]. Denote the set of these measurable symmetric function by W. 

Definition 4.1. Let W be a function in W. A graph G from the exchangeable random 
graph model Q(n,W) is given as follows. The vertex set of G is [n]. Let X\, . . . ,X n 
be independent random variables with uniform distribution on [0,1]. For every pair 
1 < i < j < n add an edge ij to G with probability W(Xi, Xj). 
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Note that for given X 1; . . . ,X n the probability to get a graph with edge set E from 
g(n,W) is 

nwix^xj) n (l-w^xA). 

ij&E ij£E(K n )\E 

We collect some basic facts from [T5]. To proceed further the expectation values of 
subgraph densities is computed for the exchangeable graph models. 

Proposition 4.2. Let F be a graph with vertex set [m] and let W G W. The expected 
value of t(F, G) when G is from Q(n, W) is 



E(t(F,G))= TT W(xi,Xj)dxi---dx r , 



'[0,1]"- ijeE{F) 



where it is assumed that m < n 



We use the notation t(F, W) = f\ 1 ^ m Ylij^E(F) W(xi, Xj)dxi ■ ■ ■ dx m for the expectation 
values calculated in Proposition 4.2 For a vector of graphs F = (F\, . . . ,F d ) we define 
t(F,W) = (t(F 1 ,W),...,t(F d ,W)). 

In general it might be hard to compute t(F, W) explicitly. 

Definition 4.3. A W G W is a stepfunction, if for some partition of [0, 1] into intervals, 
the value of W(x, y) is completely determined by which parts of [0, 1] that x and y belongs 
to. 

For any symmetric n xn matrix M with entries in [0,1], the stepfunction Wm takes 
the value of M at row i and column j at [(i — l)/n,i/n) x [(j — l)/n,j/n). 

Note that the value of Wm is not defined on the entire boundary of [0, l] 2 , this do not 
matter for our purposes since the boundary have measure zero. 

The stepfunctions are useful since any function in W can be approximated by a step- 
function, and for stepfunctions W it is possible to establish a polynomial expression for 
t(F,W). 

Proposition 4.4. Let M = [ 

TTii,j]nxn be a symmetric matrix with entries in [0, 1] and let 
F be a vector of graphs. Then 

t(F,W M ) = (p Fl]n (M), P FrAM),...,PF d AM)), 

where 

p F ;n(M) = -j^ Yl II m W)m- 

cf>:V(F)->[n] ijeE(F) 

Proof. Let F be a graph with vertex set [A;]. Recall that the expectation value t(F, Wm) 



is 



/ n ^K^-ds, 



I ' 1 ! ijeE(F) 

Partition [0, l] h into the parts — l)/n, ii/n) x [(i^ — l)/n, fa/n) x ■ ■ • x \{i k — l)/n, ife/n), 
the boundary have measure zero and can be ignored. Observe that there is a bijection 
between the functions (f) : V(F) — > [n] and the parts in the partition of [0, l] k . The 
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function corresponds to the part [(0(1) — l)/n, 0(l)/n) x [(0(2) — l)/n, 0(2)/n) x ■ ■ ■ x 
l)/n,0(A;)/n). 

The function riyesfF) ^M( x h x j) is constant n^eBf-F) m <f>{i),<f>(j) on ^ ne P ar ^ correspond- 
ing to 0. The integral of YlijeE(F) WM{%u%j) over the part corresponding to is then 

^Jf\ HijeE(F) 171 W), 

The integral over [0, l] fc then splits into the sum 

f k n w M(x i ,x j )dx 1 - ■ -dx k = ^ ^2 n 



t ' 1 ^ ijeE(F) 4>iV(F)-*[n] ijeE(F) 



□ 



The polynomials pF- n (M) in Proposition 4.4 can be computed and evaluated, this makes 
it possible to study their image as a subset of the polytopes. 

Definition 4.5. Let F = (F\, . . . , Fd) be a vector of graphs and n a positive integer. The 
curvy zonotope is 

Z F ;n = |(pF i; n(x),PF 2 ;n(x), . . . ,p Fd;n (x)) | X £ [0, l]™',^ = XjA 

where ^ 

^«( x ) = ^jf\ yi n x *w^o> 

</>:V(F)->[n] ijeE(F) 

The curvy zonotope -Z'(x 3 ,c*4,x 4 \e);2 is drawn in Figure [7j 



Figure 7. The curvy zonotope Z^ Kzj c^K A \ey,2- As defined in Definition 



4.5 



The curvy zonotope Zy-^ consists of the expected values from random graphs models 
according to Proposition 4.4, and this implies that it is a subset of the polytopes Py-tu- 
The curvy zonotopes contain the spines as the spines come from the special case of the 



POLYTOPES FROM SUBGRAPH STATISTICS 



15 



function W being constant p. The curvy zonotopes have many desirable properties that 
the spine do not, for example full dimensionality. 

Theorem 4.6. Let F = (F\, . . . , Fd) be a vector of d distinct graphs with no isolated 
vertices and with no > max{|Fi|, . . . , \Fa\). If n > n then the curvy zonotope Zp. n is not 
contained in a hyperplane in M. d . 

Proof. The curvy zonotope is defined in terms of polynomials coming from graphs. The 
idea of the proof is to identify a term in the polynomials with a special exponent. This 
special exponent ensure that no linear combination of the polynomials can be constant 
and this proves that the curvy zonotope is not contained in a hyperplane. 
Let F be a graph in F. The graph F give a polynomial 

4>:V(F)->[n] ijeE(F) 

as in the definition of the curvy zonotope Z-$. n . 

Assume that the curvy zonotope is in the hyperplane defined by Ci^ip^^x) + • • • + 
C\F d \PF d -n(x) = C. Let F be a graph with maximal order among the graphs Fi with 
C„ / ()'• 

Let : V(F) — > [n] be injective. The function give a term YlijeE(F) x <t>(i)<f>U) * n t ne 
polynomial p^ ; „(x). That is injective implies that the graph F', with the indices occur- 
ring in Y\ ij€E(F) X4>(i)<f>(j) as vertex set and the pairs of indices occurring in H ijeE{F) x<t>(i)<f>(j) 
as edge set, is isomorphic to F. 

Let F' be a graph with Cf' ^ and let pF'-n contain riije-B(F) x <P( i )4'U)- ^ ne f unc ti° n 
was injective from V(F) and so |F'| must be at least as large as \F\, but |F| was maximal 
and hence |F| = The function 0' : V(F') ->■ [n] giving the term I\ ije E(F) x Hi)4>U) 
is then injective. But it is possible to reconstruct F from Y\ijeE(F) x 4>(i)4>U)^ assum i n g 
that the function giving the term is injective. This implies that no other polynomial from 
a graph F 1 with Cp' contains the term YlijeE(F) x <t>( i )'l > U)- This ensure that no term can 
cancel the unique term C f Jf\ Yl ije E(F) x <f>(i)4>U) in tlie polynomial C| Fl |p Fi;n (x) + ••• + 
C\F d \PF d ;ni?^)i and so the polynomial can not be constant. This is a contradiction and the 
curvy zonotope is not contained in a hyperplane. 

□ 

The curvy zonotope is not in a hyperplane, this means that there are points in it that 
span ci-dimensional simplex and then all the polytopes Pf- h contain this simplex and is 
full dimensional. This establishes the following corollary: 

Corollary 4.7. Let F = (Fi,...,Fd) be a vector of d distinct graphs with no isolated 
vertices and with n > max{|Fi|, . . . , \Fa\}. There is a constant Vf > such that that the 
volume of P-p-n is larger than Vp, for n > n . 

The volume of the polytopes do decrease with increasing n. But it is not the case that 
the volume can become arbitrarily small. 
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Figure 8. 



5. Limit object 



Recall that the polytopes of subgraph statistics satisfy the inclusion Pp n C Pf,ti when 
n<n. This makes it possible to define a limit object. 

Definition 5.1. Let F be a vector of graphs where no entry have more than no vertices. 
Define the limit object 



Describing the polytopes is very hard for most n, but for very large n the polytopes 
should essentially look like the limit object. Finding good descriptions of the limit objects 
and their properties is the goal of this section. As it turns out, the object are closely related 
to graph limits and the exchangeable random graph models. 

It seems almost hopeless to get a simple description of the polytopes Pp ; „ for any 
nontrivial vector F and integer n. For the limit objects the situation is much better. 
Known results in extremal graph theory can be used to get good descriptions of the limit 



Razborov found in [19] the exact minimal possible triangle density for given edge den- 
sity, when the number of vertices in the graph go to infinity. Razborov used machinery 
called flag algebras that he developed in [20] which is related to graph limits. If T(jp) is 
the minimal possible triangle density for edge density p, then {(p, T(p)) | p e [0, 1]} is a 
curve in the limit object P(k 2 ,k 3 ),oo- The convex hull of the curve {(p,T(p)) | p e [0, 1]} 
is the limit object P(k 2 ,k 3 ),oo- The convex hull of the curve has a nice explicit description: 



it's the hull of {(p, T{p)) \ pe [0,1]} is conv {(0, 0), (1, 1), (1 - §, 0), (1 -§,§),..., (1 - 



In this section the theory of graph limits is used to reinterpret the limit object, and to 
give a way to study it when an explicit description is not known. 




objects. 



1 (fc-l)(fc-2) 
fc> k 2 



),...}. 
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In [15] limits of certain sequences of graphs were defined. A sequence of graphs 
(Gi)i = i ; 2,... is said to converge if the sequence (t(F, Gi)) i=1> 2,... converges for every graph 
F. One of the important facts about convergent sequences of graphs is that they look like 
they come from exchangeable random graph models. One result in this direction from 
[15] is the following: 

Theorem 5.2 (Lovasz-Szegedy [IS]). If the sequence of graphs (Gi)i=i,2,... converges, then 
there is aW G W such that t(F, W) = lim^oo t(F, G^) for all graphs F. LetW G >V and 
let (Gj)i=i,2,... be a sequence of random graphs generated by the exchangeable random graph 
model Q(i, W), then (Gj) 1=1,2,... converges almost surely. In fact t(F, W) = lim^oo t(F, G») 
almost surely for all graphs F. 

The main ingredient in the proof is a method to pick a subsequence of (Gi)j=i,2,... 
with Szemeredi regularity properties. Using this subsequence it is possible to construct a 
suitable W. 

There is a useful norm on the space of functions [0, l] 2 — > R. 
Definition 5.3. The rectangle norm of a function W : [0, l] 2 — > R is defined by 



||W||n = sup / W(xi, X2)dxidx 2 - 

AC[0,1] 2 J A 

In the previous section we calculated with stepfunctions. In more general situations it 
might be hard to get exact results, but it is possible to approximate any measurable by 
a stepfunction. 

Lemma 5.4. For any vector F of d graphs with at most e edges, and any W G W, there 
is a stepfunction U with at most 2 kd£ steps (all of the same width) so that 

\t(F,W) -t(F,U)\ < dee. 

Here kd is a fixed constant depending on d. 

Proof. This is essentially a straightforward application of the weak Szemeredi regularity 
lemma contained in [8], as explained in [15]. There is a constant k, so that for any e > 
and W G W, there exists a stepfunction U (with all steps of the same width) and at most 
2 ke steps, so that \\U — W\\q < s. The rectangle norm used, relates to the fact, that, 
according to Lemma 4.1 of [15J, \t(F, U) - t(F, W)\ < \E(F)\\\U - W\\ a . □ 



One part of the proof of Theorem 5/2 uses the following very useful result. 

Lemma 5.5 (Lovasz-Szegedy |15]). Let (Gj)i>i be a sequence of graphs. Then there is 
an infinite subsequence (Gi)i>n e s an d a W G W so that 

lim t(F, d) 

i— »oo,i£S 

exists for all graphs F , and it equals t(F, W). 

The limit object, the intersection of polytopes of subgraph statistics, is related to the 
graph limits of Lovasz and Szegedy. The limit object is the convex hull of expectation 
values. 

Theorem 5.6. For every vector F of graphs, Pf ; oo = conv{t(F, W) \ W G W}. 
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Proof. Denote with Pp ; w the convex hull defined in the theorem statement. 

First we prove that any x G Pf ;00 is in Pf-,w- By definition Pf ;00 is the intersection 
of the d- dimensional polytopes Pf-u for n is larger than some Uq. By triangulating the 
interior of every Pp ;n using only boundary vertices, we can describe x in a simplex for 
every n > no '■ 

d 

X ^ ^ C^j,n^j,n 
j=0 

where each vertex of pF ; n, each aj. n > 0, and J2j=o a j< n = 1- ^ OT eYeT Y vertex in 

a Pf,u there is a graph realizing that subgraph statistics. Choose d + 1 graph sequences 



{GifM" so that t(F,G* n ) = v 



Use Lemma 575] on (G°) n>no to find an index set So an d a symmetric measurable 
function W : [0, l] 2 — > [0,1] so that lim n ^ oo ng 5 G° ) exists for all graphs F, and it 
equals t(F, W Q ). 



For j = 1,2, ... ,d repeat the use Lemma 5.5, but on (G^) ne 5 j _ 1 to find an index set 
Sj and a symmetric measurable function Wj : [0, l] 2 — > [0, 1] so that lim^oo jigs,. t(P, G 3 n ) 
exists for all graphs F, and it equals t(F, Wj). 

The points t(F, Wo), ■ ■ ■ ,t(F, Wj) spans (a possibly degenerate) simplex in Pf ; w an d 
we should locate x in it. The point (a 0) n,«i,n, • • • ,«d,n) is in a d-simplex for all n, and 
hence there is a subsequence S' C so that lim^oo ng s/ a,- )T , = a,-. 

Collecting the preceding limits, gives the desired 

d 

x = ^2& j t{F,W j ). 

j=0 

Now we prove the other direction. Pick a point x in Pf-w parametrized by 

d 

x = Y i a J t(F,W J ) 

j=0 

and assume that x G" Pf ; oo- Since Pf ;0 o is a closed convex set there is a hyperplane H 
strictly separating x and Pf ;0 o- By definition Pf ;0 o is the intersection of Pp;n ^ pF;n +i ^ 
PF;n +2 3 ■ ■ ■ for some n . 

Assume that all polytopes Pf-u intersect the hyperplane H, and pick a convergent 
sequence {xi}i =n0tno+ i^„ such that Xi G Pp ; i H P. The sequence {£i}i=no,n +i,... con- 
verge to something in H and the polytopes are compact, this implies that the limit of 
{xj} i=no no+ i v .. is in all polytopes Pp ; i. This is a contradiction, H will not intersect all 
Pf-u- For n\ large enough, the hyperplane H will not intersect PF;m, and x and Pf ; th are 
separated by P. 



Generate random graphs G J n G Q(n,Wj) for n > rii. According to Lemma 5.5 
lim^oo t(F, G J n ) exists and converges to t(F,Wj). Note that all of the points t(F,G 3 n ) 
are in the closed convex set Pf-ui, and hence so is all their limits t(F,Wj). But then 
x = Ylj=o otjt(F, Wj) should also be in Pp. ni , which contradicts that x and Pp ;ni are 
separated by a hyperplane. □ 



The curvy zonotopes can also be used to approximate the limit object arbitrary well. 
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k d2&2 

Theorem 5.7. Let F be a vector of d graphs on at most e edges. If e > and n > 2 d e 2 

then 

< Vol(P F; oo) - Vol(conv Z F . n ) < e. 
Here k^ is a constant depending only on d. 

Proof. Any point in the limit object Pf i00 is in the convex hull of 

{t(F, W) | W : [0, l] 2 -> [0, 1] symmetric and measurable}, 



by Theorem 5.6, and the points of Z F n is a subset of those, since they are all on the 



form t(F, U) for some stepfunction U with n steps. According to Lemma 5.4 any t(F, W) 
is within distance k^e of a point in the curvy zonotope Zp, n , and then according to 
Lemma [3.21 we are done. □ 

6. Conjectures about the limit object P F;00 . 

We have previously inscribed cyclic polytopes in the limit objects. We will now define 
another cyclic polytope and conjecture that a particular class of limit objects actually are 
cyclic polytopes. 

The vertices of P(K 2 ,K n )-,oo are given by the limits of complete fc-equipartite graphs 
according to results by Bollobas [HE]. It is not hard to see that P(k 2 ,k„);oq is a cyclic 
polytope, and we believe that this is true in a more general setting. 

For positive integers t\ < e% < . . . < e m define the tail s e : [0, 1] — > [0, l] m by 

*?(*) = 11(1 -js). 
j=l 

Proposition 6.1. The convex hull of any finite set of points on a tail is a cyclic polytope. 

Proof. Let + Consider the matrix whose columns are the points on the polytope, 
ordered by increasing x. The polynomial on row i is of degree i and has nonzero coefficients 
on all terms of lower degree. There is then a sequence of row operations that takes this 
matrix to a matrix with the same determinant but the polynomial on row i is a monomial 
of degree i. In this situation the argument from Zieglers book [2H] applies and the polytope 
is cyclic. 

The general case, with arbitrary is obtained from the case = i + 1 by projection: 
any tail is obtained from the tail with = i + 1 by removing some coordinates. The tail 
with Ci = i + 1 is generic and this is the property which ensure that points on it span 
cyclic polytopes, removing coordinates from a generic curve give a new generic curve. □ 

Conjecture 6.2. Let e\ < e2 < ■ ■ ■ < e m be positive integers and s e their tail. The convex 
hull of 1 and {s e (l/k) \ k = 1, 2, 3, . . .} is P(K ei ,K e2 ,...,K em y,oo- 

The conjectured vertex description of P(K e ,K e ,...,K em y,oo also gives a facet description 
since it's essentially a cyclic polytope. If we would chop off the vertex 1 from the convex 



hull described in Conjecture 6.2 with an hyperplane, then the remaining convex set would 



be an ordinary polytope. We believe this is true in the following general form. 
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Conjecture 6.3. For any vector F of graphs there is a positive integer m, such that for 
any e > 0, the limit object Pf ;00 can be chopped down to a polytope with a finite number 
of vertices, by using m hyperplanes to remove at most a volume e. 
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